Automatic Evaluation of Spoken Dialogue Systems

نویسندگان

  • Wieland Eckert
  • Esther Levin
  • Roberto Pieraccini
چکیده

We advocate an objective evaluation methodology for the automated evaluation of spoken dialogue systems that eliminates manual interaction and reduces annotation errors and personal bias. The evaluation is performed by observing interactions between the system and a simulated user. We argue that user simulation is an inexpensive and feasible method for optimizing a dialogue system in the lab. Using a simulated user we can conduct dialogues until the performance measure reaches a predetermined conndence level. A simulated user not only exercises the dialogue system and points out defects, it also helps predict the success of a modiied dialogue strategy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Galaxy-II as an Architecture for Spoken Dialogue Evaluation

The GALAXY-II architecture, comprised of a centralized hub mediating the interaction among a suite of human language technology servers, provides both a useful tool for implementing systems and also a streamlined way of configuring the evaluation of these systems. In this paper, we discuss our ongoing efforts in evaluation of spoken dialogue systems, with particular attention to the way in whic...

متن کامل

Evaluating Automatic Dialogue Strategy Adaptation for a Spoken Dialogue System

In this paper, we describe an empirical evaluation of an adaptive mixed initiative spoken dialogue system. We conducted two sets of experiments to evaluate the mixed initiative and automatic adaptation aspects of the system, and analyzed the resulting dialogues along three dimensions: performance factors, discourse features, and initiative distribution. Our results show that 1) both the mixed i...

متن کامل

Quality of telephone based spoken dialogue systems pdf

Quality of Telephone-Based Spoken Dialogue Systems. Quality of Human-Machine Interaction over the Phone.Quality prediction models for telephone-based spoken dialogue systems. Perform to guarantee an acceptable overall quality for the user. In order to.action HHI as one reference for telephonebased humanmachine interaction HMI. The quality of interactions with spoken dialogue systems is difficul...

متن کامل

Task Complexity measurement in the evaluation of Spoken Dialogue Systems

Spoken Dialogue Systems (SDS ́s) is a technology that provides users with a speech based interaction with machines to carry out several activities in the pursuit of their goals (information retrieval, business transactions, automatic translation, etc.). Development of this kind of systems needs an evaluation process to show the real system performance, to identify implementation errors, etc. The...

متن کامل

Speech recognition performance and learning in spoken dialogue tutoring

Speech recognition errors have been shown to negatively correlate with user satisfaction in evaluations of task-oriented spoken dialogue systems. In the domain of tutorial dialogue systems, however, where the primary evaluation metric is student learning, there has been little investigation of whether speech recognition errors also negatively correlate with learning. In this paper we examine co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998